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Abstract 



Oh Discretizing variational principles, as opposed to discretizing differential equations, leads to discrete-time analogues 

of mechanics, and, systematically, to geometric numerical integrators. The phase space of such variational discretiza- 
tions is often the set of configuration pairs, analogously corresponding to initial and terminal points of a tangent 
a^ vectors. We develop alternative discrete analogues of tangent bundles, by extending tangent vectors to finite curve 

segments, one curve segment for each tangent vector Towards flexible, high order numerical integrators, we use these 
^^ discrete tangent bundles as phase spaces for discretizations of the variational principles of Lagrangian systems, up to 

the generality of nonholonomic mechanical systems with nonlinear constraints. We obtain a self-contained and trans- 
parent development, where regularity, equations of motion, symmetry and momentum, and structure preservation, all 
have natural expressions. 



1 Introduction 

A discretization of a Lagrangian system L : TQ — > R consists of 



1^ 

o 

00 

/-i^ 1 . a time step h > 0; 

J> 2. the discrete phase space (3 X (3 = {(^^,^")), thought of as a discrete version of the tangent bundle ^(S; 



3. a discrete Lagrangian L^ : (3x(3 — > R, obtained by approximately integrating L over an appropriate interpolation 



^ from q to q'^ 



The discrete Lagrangian L/, and the discrete phase space QxQ together define a discrete Lagrangian system. Evolutions 
are sequences qt e Q, k - I,. . . ,N that are critical points of the discrete action S , defined by 



S = 2_^Lh{qk,qk-\), 



k=\ 
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subject to the constraint qo and q^ constant. As is easily shown, q^ is an evolution if and only if it satisfies the discrete 
Euler-Lagrange equation 

dLh ^ dLh 

^—(qk,qk-i) + ^—(qk+i,qk)^0- (l) 

oq^ oq 

Lagrangian discretizations lead to (implicit, symplectic, and momentum conserving) numerical methods because 
Equation ([TJ can be used to advance through time h by stepping states iqk,qk-i) to {qk+i,qk)- 

Such Lagrangian discretizations are towards discrete Lagrangian models that reflect physical reality so well that 
they have a stature with continuous Lagrangian models. States of a continuous model — such as values of the indepen- 
dent variables of a differential equation — and states of a discrete model — such as the pairs of configurations used in 
the map implied by ([T]i, can both serve as abstract representations of system states. Most important is not the particular 
representation of the states, but whether the states evolve as the physical system does. If errors, in either a discrete or 
a continuous model, are below measurement errors of a physical system, and neither the discrete nor the continuous 
model violates fundamental physical principles to that accuracy, then both models have a similar stature. Lagrangian 
discretizations also provide variational discrete analogues of continuous Lagrangian systems. They are of interest in 
themselves, and they provide a framework for the analysis, understanding, and development of geometric integration 
algorithms (,12 J for Lagrangian systems as purely mathematical objects. For more details, see (l7l [T0lfTn[T6l[T7ll25]) . 

In this article, we further develop discretizations of Lagrangian systems. We refer to the discretizations outlined 
above as Moser-Veselov (MV) discretizations. Our discrete Lagrangian systems replace the MV discrete phase space 
QxQ with a discrete phase space 'V consisting of curve segments in Q which one-to-one correspond with elements 
of TQ. To any such discrete Lagrangian system, there is naturally associated a isomorphic MV discrete Lagrangian 
system, obtained by identifying our curve segments with their boundaries. Viewing tangent bundles as curve segments 
is generally consistent with viewing discretizations in general as attaching to a manifold finite rather than infinitesimal 
objects (5 ). 

Systematically using curve segments provides theoretical flexibility and geometric clarity. For example, our curve 
segments can be naturally shrunk, and this helps to analyze limits where the time step tends to zero. The interpolating 
curves of MV discretizations are obtained implicitly from boundary value problems, with boundary values the two 
configurations of the MV discrete states. We finesse this implicit dependence by directly using the interpolating 
curves. We achieve a self contained variational theory, which does not depend on discrete versions of the Legendre 
transform, nor on any canonical formalism on the cotangent bundle. We show that the entire development extends 
to nonholonomic systems with nonlinear constraints. We extend the curvature conditions for holonomic subsystems 
of nonholonomic systems (.19J to the discrete case, and prove the nonholonomic momentum equation (|4l [Til in our 
context. As well, we show how our discrete Lagrangian systems speciahze to discrete holonomic systems. 

Some notations: Unless otherwise noted, objects are sufficiently smooth to permit the required operations. If Al is 
a manifold and v™, w„, e TAi, then define 



d 

vert,, Wa = — 

" ^ dt 



{Vq + tWq). 
=0 



If ;r: £■ — > A1 is a vector bundle, and z e Ti)^,E i.e. if z is a tangent vector at the zero section, then we denote the 
horizontal and vertical parts of z by horz e T„,M and vertz e £„,, respectively. We denote the fiber dimension of a 
fiber bundle by fdim and the fiber codimension of a subbundle by cofdim. To reduce double subscripts, we sometimes 
use the functional notation x{k) instead of Xk for a sequence. If A is a set, then we will use the notation A{M,N] for 
the sequences in v{k) e A, k - M, . . . ,N. \f Q acts smoothly on a manifold M then we denote the Lie algebra of ^ by 
g, and the infinitesimal generator of^e q atm e Ai by 



d 



exp(^t)m. 

1=0 



Assembling these into a vector field gives f ai('w) - ^'W- 



2 Discretizations of Tangent Bundles 




1 1-> il/(h, t, m) 



Let A1 be a manifold and m e A1. Two curves c: ia,b) — » A1 and c: {a,b) — > Al with e {a,b) and e {a,b) are 
tangent at m if 1) c(0) = c(0) - m; and 2) 0c(f)-0c(f) = (9(f^)inany chart of A1 with domain including m. Tangency 
at m is an equivalence relation, and the tangent space T,„Ai at m e M. may be defined (2) as the set of equivalence 
classes of curves at m. Our discretizations of Lagrangian systems depend on the development of a discretization of a 
tangent bundle TM. as assignments of curve segments in M. to tangent vectors of h\. We will require a parameter h 
such that TM. is obtained in the limit h — > 0^. So, we posit a map \l/{h, t, m), with values in A4, and obtain the curve 
segments 1 1-> t//(h, t, m): 

Definition 1. A C'' discretization of TAi, k > I, is a tuple (tfr, a^ , a ), where 



tj/: U QWxM^ M, a^ ; [0, a) ^ R>o, 
are such that 

1. tfr is continuous, U is open, and [0] x {Oj x A1 £ U; 

2. a^,a^ areC', anda^{h) - a^ih) - h; 

dip 

3. i//(h,0,v„,) — m, and ^— (/i,0, v,„) = v„,; 

ot 

4. the boundary maps defined by 

dl{v,„) = ^{h,a'{h),Vm), 

are C* in (h, v„,), and 

£/„+,..+ d 

— d,,(v„,)^a v,n, — 

an h=o an h=o 



a': [Q,a) 



^<07 



dt(vm) = ilf{h,a^{h),Vm), 



d,, (Vm) 



(2) 



(3) 



where 



an 



"- = S<»'^ 



Remark 1. Putting h - in a^(h) -a (h) - h gives q'^(O) -a (0) = because a* >0 and c < 0. Differentiating 
a^{h) - a^(h) = hath-0 gives a^ - a" = 1. If (/» is C', then Assumptions ([3| are superfluous, since 



d_ 
dh 



d_ 

/i=o dh 



dl(vm) = — 



i//(h, a^(h), v„) = — 
/i=o an 



i/r(/z, 0, v,n) + — 
h=a an 



\l/{0,a^{h),v,n) - a^v„, 

h=0 



and similarly with 5,^ . The definition allows t// to be only piecewise smooth in h, t. 

For all v,„ e M, the set {(/j,f) : {h,t,v„^ e {/} is open and contains h - Q,t - Q, and so contains the set 
{(/i, t) : h e [0, /?), a"(/i) < f < a^C/j)} for some < b < a. So assigned to every v,„ e TWl and small enough /i > is 
the curve segment 

t H-> i/r(/j, f, v„), q;~(/!) < f < a^(h), 

dil/ ^ _ 

which, since Mh, 0, v„) — m and -^^(h, 0, Vm) = v^, is a curve at m which is tangent to v,„. A discretization (iff, a , a ) 

ot 
assigns to every v,„ a curve segment that can be thought of as a translational step like hv„. 

Example 1. Let M = R^, < y < 1, a'^(h) = yh, a'{h) = -{I - y)h, and iff(t, h, v,„) = m + tv. More generally, let X 
be any second order vector field on A1, < y < 1 and define iff^{h, t, Vg) = TMF^(vq) where Ff is the flow of X and 
Tyvi : TM — > Al is the canonical projection, if/^ and a^ , a^ is the X-discretization with bias (a^, a^). 

Example 2. Let X be a vector field on a manifold M. A one-step numerical method for X is a map tp: U Q [0, oo) x 
At — > Al such that 

1. {0)xMc U; 

2. (p(Q, m) - m for all m e M; 

(p{t, m) — X(m). 



dt 



If 1^ is a one-step numerical method for a second order vector field on TM then ^(h, t, v^) = tm fit, Vq) is a discretiza- 
tion of TM. 

Generally, when we speak of a discretization we mean a. family of discrete objects parametrized by /i € R, such 
that a continuous target is approached as /i ^ 0. A discrete object is an instance of a discretization, obtained by fixing 
hto a particular value and possibly dropping data not required to make operational the discrete representation of the 
continuous target. For tangent bundles, we choose the transition from discretization to discrete as the juncture at which 
we drop the curve segments in discretizations of tangent bundles, retaining only their endpoints: 

Definition 2. Let M be a manifold. A discrete tangent bundle of M is a tuple {'V, d^, 5 ), where 'V is a manifold, 
dim'V = 2dimM andd^: 'V ^ M and d: 'V -> M satisfy 

1 . d^ and 5" are submersions such that ker Td"^ n ker TO" = 0; and 

2. for all m e M, the backward fiber 'V^ = (5^)"'(m) and the forward fiber 'V^ = (5")"'(m) meet in exactly one 
point, denoted Om. 

The discrete zero section is 0<y = (5"^)"' A(A1 x M), where A(M x M) is the diagonal of M x M. 



Remark 2. Let 5* : ^ ^ M X M be defined by d'=(p) = {d^(p),d-(p)). Item [T] of Definition |2] implies that 7,5^ 
is a linear isomorphism for all v e 'V, and therefore d"^ is a local diffeomorphism. Also, Item [2] of Definition [2] 
implies that d'^ is bijective from 0^/ to A(M x A1) so that the local diffeomorphism d'^ is a diffeomorphism of 0^/ to 
A(A1 X A1). O'v is a closed submanifold of 'V because A(A1 x A1) is a closed submanifold of Al x A1, and Theorem 1 
(semiglobal inverse function theorem) of (8 ) provides open neighborhoods U of Qry and V of A(A1 x A1) for which 5"^ 
is a diffeomorphism. 

Remark 3. Our discrete tangent bundles are similar to the groupoid based constructions of discrete phase spaces for 
Lagrangian systems in ('24'). Indeed, to any discrete tangent bundle 'V there is an associated Lie groupoid consisting of 
1) sequences v^ in 'V which satisfy 5^(vt) - 5"(vi+i), and the reverses of these, 2) units the elements 0„,, and 3) source 
and target maps d^ and 5 . Some of the constructions below are the same as those found in the groupoid context. 
One can regard discrete tangent bundles of A1 to be groupoids over A1 for which the set of irreducible elements 'V is 
a manifold which satisfies Item [T] of Definition |2] To the extent of this article, the algebraic structure of the groupoid 
seems to generate more ambiguity than it does clarity. For example, starting as in Definition l2] with a tangent vector 
V e'V, one might include the formal reverse of v in order to have its groupoid inverse. There will generally be another, 
different, element in 'V with the same source and target as that formal reverse. And, the product of that with the 
original v is a two element sequence that starts and ends at the same place of M, but it is not the same as the discrete 
zero vector 

Remark 4. Given any manifold M, the tuple {M x M, c^t^^Ai' '^aixai^ ^^ ^ discrete tangent bundle, where 

M X M = {(m+,m")), 5^^^(m+,m") = m' , d^iy^M^"^^^"^'^ = "^^ ■ 

Thus the usual MV discretizations of the tangent bundle are special cases of Definition|2] For a discrete tangent bundle 
CV, d^, 5 ), the map d'^ is typically a diffeomorphism in the region of interest, so one can in principle, using d'^, replace 
any discrete tangent bundle (Al, d^, d ) as in Definitionl2]by its image in A4 x M, dispense with the maps 5^ and d^, 
and use the corresponding MV discrete tangent bundle: 

'V > MxM 



M 

However, the freedom of including d^ and 5" in the definition of a discrete tangent bundle, and the abstraction of the 
discrete tangent vectors as elements of a manifold 'V, is helpful and clarifying. 

Since discretizations are to be families of discrete analogues, it is necessary to show that a discretization of a 
tangent bundle gives discrete tangent bundles for sufficiently small h (Proposition [T] below). This is not immediate 
because there is a singularity at /z = 0. To see the problem, consider the example M = M.'^, with discretization 
if/Qi, t, (m, v)) = m + tv + O(h^). To show ItemSof Definitional it is sufficient to show the 5^ is a diffeomorphism near 
/? = i.e. that the equations 

m" - m + a'(h)v + O(h^), m^ - m + a^(h)v + O(h^), (4) 

can be solved uniquely and smoothly for (m, v) in terms of (m^, m ). Because there is no a priori knowledge of the 
details of the O(h^) term, the proof of that has to be perturbative from h - Q. However, when h - Q, Equations (|4]) 
cannot be solved at for v because they reduce to m^ - m, m^ = m. Replacing v with v = v/h would solve the problem 



but would require, at later stages, smoothness assumptions in vjh, and would make the subsequent development 
awkward and less general. But, replacing (m^, m^) with the new variables {m,z) defined by 

(5) 
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m^ 


— 


m 


Til 




2 


^ 




~h 




converts 


Equations Q to 
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-m + 


a^Qi) + a 


-(h) 


V + 


0( 


'hh 



..„,..„ z^v + 0{h). (6) 

We remark that a division by h has reduced the order of the trailing term in the second equation. At h = 0, Equations (|6]l 
are 



which has solution m = ill, v = z. By the implicit function theorem. Equation (|6| may be solved for m, v in terms of 
ill, z for sufficiently small h, and hence, through Equation (|5]l, for m, v in terms of (m^, m^), as required. To obtain a 
result near h -0 which is valid for m, v near the whole zero section O(T'AI) i.e. local along h but global along 0(TAi), 
we again make use of Theorem 1 (semiglobal inverse function theorem) of (8). 

Proposition 1. Let (up, a^ ,a^) be a discretization of the tangent bundle of M and let Mo <Z Aibe a relatively compact 
open set. Then there is an a > such that, for all h e (0, a) there is on open set "Vh £ TM such that 

1. the tuple CYi,, ^/J^, 5p is a discrete tangent bundle of Mq; 

2. dj^ is a dijfeomorphism from 'V/, to an open neighborhood of A(A1o X Mq). 

Moreover, for all v,„ e T,„Mo, there is a sufficiently small h and 'V/, such that v,„ e 'V/,. 

Given a discretization of a tangent bundle, one can obtain, by choosing h small enough, a discrete tangent bundle 
which provides discrete analogues of arbitrarily large tangent vectors. In physical contexts, this implies that arbitrarily 
high velocities can be accommodated in the discrete systems by using sufficiently small time steps. 

Proof of PropositionU\ In the generic manifold context the construct m^ - m^ of Equation (|5]l is unavailable, but 
it can be replaced by the fibers of a tubular neighborhood of the diagonal A(A1 x A1). The vector bundle E = 
[{v,n, -v,„) : Vm € TM] is a normal bundle to the diagonal A(A4 x M) of My. M, so there is a tubular neighborhood 
^■.W'^ ciE-^ W^^-^ c M X M. 

The diffeomorphism ( may be chosen so that T^ is the identity on the zero section 0(£') with respect to the 
horizontal- vertical decomposition i.e. for all w e 2o,,„ ,„,£■, 

T^(w) - hor w + vert w. (7) 

Any (v,^j, v,7j) e T(„^„){M x M) may be decomposed as 

(v+ v") = (-(v+ + v") -(v+ + v")l + |-(v+ - v") -(v" - v+) 

If r^^'Cv^,, v,7,) = w then this is the unique decomposition of T'^(w) according to the direct sum JJ;,,, „,)(A1xA^) ©£■(„„,)■ 
By Equation (|7]l, hor w + vert w is also this decomposition of TC,(w), so 

vertrr'(v;, v-J = (i(y; - y,;), i(v;, - y^)] . (8) 



Consider the map <^: [(h, v,„) : d^(v„) e W^""^} -^RxEhy 



<p(h, v„) = 



("•(y-y))- '- 



Using Equations Q, and ([8]), 
d 



vert 
dh 






so (^ is smooth by ^, Proposition 1 . i^ is a local diffeomorphism at any (0, v,„) since the derivative of (f is nonsingular 
there, and i^ is a diffeomorphism from {0) x TM to {0) x £, so ^ is a diffeomorphism from some open neighborhood 
of (0) X TM to some open neighborhood of jO) x £ ((8 ), Theorem 1). The domain of the map 

ip{h, (v„, -v,„)) = {h, ^{hVm, -hvm)) 

includes {0} x £, ^ is a diffeomorphism except at h - 0, and 

{h,d^{v,n)) = ip(p(h,v„,). 

Thus there are open neighborhoods {/ 2 {0) x TM and W 2 {0) x A(M x M) such that (h,v„) h-> {h,df^(v,„)) is a 
diffeomorphism from U \ ({0) x TM) to W \ ({0} x (M x M)). 

Given a relatively compact open Mo Q M, choose a > such that [0, a) x A{Mo x Mo) £ W. Assume < h < a 
and define 

^n = { v„, : (h, y„,) eU}n (5,t)"'(Mo) n (5,r'(Mo). 

df^ is a diffeomorphism on ^/, since (h,Vm) i-^ (/z, 5^(vm)) is a diffeomorphism. Also, {h,(mo,mo)) € W for any 
OTo € Alo, so <5^(v',„) = mo and ^^^^^(vm) - mo where v,„ is defined by (jo(/i, v„,) = {h, (mo, mo)), hence 5,^ and dj^ are onto 
A^o- By continuity, given any v„, e TMq, h can be chosen so small that d^(Vm) e Mo and d^{Vm) e AIoj and so small 
that {h, v,n) e C/- Thus h can be chosen so small that v„, e 'V/,. n 

As an aside, we get the following Corollary, which, given a second order vector field, is obtained by applying the 
proof of Proposition fl] (particularly the construction of ip) to the X discretization with bias a^{h) - 0, a^{h) - h. 

Corollary 2. Let X be aC^ second order vector field on TM, k > 2. Then there are open neighborhoods U 2 { ) X TM 
and W 2 {0) X A{M x M) such that (t,v) h^ {t,TMF^{v),TM{v)) is a C*"' dijfeomorphism from t/ \ (0) x TM to 
y \ {0) X A(M X M). 

Corollary |2]is important when constructing classical generating functions of type 1 for Lagrangian systems L : (3 ^ R. 
These are functions on (3 x<3 which are defined as integrals of the Lagrangian L over solutions with specified endpoints 
i.e. the classical action as a function of endpoints. To well define the generating function using the flow of the 
Lagrangian vector field, one should construct the map A,(g'2, ^i) from (3 x (3 to 7'<3 that returns the initial velocity at 
qi that evolves to ^2 over time interval t. The generating function is then 



S,{q2,qx)= f F^^iMq2,qi))ds 
Jo 



where Xe is the Euler-Lagrange vector field and F^'^ is its flow. The map A{q2,qi) cannot be straightforwardly 
constructed using the implicit function theorem (as is attempted for example in (2 )) because of a singularity at f = 0: 
infinite velocity is required to traverse from qi to q2 in zero time. But the map A(q2,q\) is easily extracted from 
Corollaryl2] The MV discrete 'exact' Lagrangian (fT6t is the same as the the type 1 generating function and both sufifer 
the same singularity. 

In the context of (3 = R^ with iff(h, t, v^) = x + tv and bias a^{h) = 0, a^(h) = h, one has 

{d^y'^inT ,m^) - {rrr,(ny -rrC)/h). 

Thus the inverse of 5^ may be given the interpretation of a difference quotient. This is used in Definition [31 to define 
the discrete derivative of a sequence in A1. That is important because it gives the definition of a discrete first order 



sequence in TM, which is crucial to the discrete variation principle LdAJj of Section 3.2 
Definition 3. 

1. Ifnik is a sequence in M, then a discrete derivative of nik is a sequence m'^^e'V such that d'^im'i^) = (m^+i, m^). 

2. A sequence v^. e 'V is called first order ifvk = m^ for some sequence m^ e M. 
A sequence m^ is first order if and only if 

5+(yi) = 5-(y,+i), (9) 

because the derivative of every sequence m,t satisfies Equation ([9|, and every sequence v^ satisfying Equation (|9]l is 
the derivative of 

niQ = d'(vo), nil = d'(vi), ...,mk= d'(vk), . . ■,mN-i = 5"(vw-i), m^ = d^(vN-i). 

By Remark I2] or Proposition [T] if the maps d^,d^ arise from a discretization, the discrete derivative of m^ is unique 
as long as the pairs (mi,+i,mi,) lie sufficiently close to the diagonal and the sequence values v^ are restricted to be 
sufficiently near the zero section. 

Let CV, d^, c? ) be a discrete tangent bundle of Al. Define the backward vertical bundle by 

vert+ 'V = ker 7,5+ , vert+ 'V s ker Td^ , 

and the forward vertical bundle by 

very, 'V = ker Tyd^ , vert" 'V = ker Td'' . 

The fibers of the forward [backward] vertical bundles are the tangent spaces to the forward [backward] fibers of the 
discrete tangent bundle. Item fl] of Definition IS] gives T'V - vert^'V © vert 'V so that every Sv e T^.^V decomposes 
uniquely as 6v - 6v^ + Sv^ where 6v^ e vert^'V and 6v^ e vertjl^'V. The signs may appear notationally reversed but 
they are mnemonic in the sense the one wants more often to apply Td^ not to elements of vert^'V, which would result 
in zero, but rather to elements of vert^'V. The convention 6v^ e vert^'V means that, usually, '+' goes with '+' to 
make something nonzero, while zero results when '+' goes with '-'. 

Remark 5. Lagrangian discretizations are towards constructing discrete Lagrangian models which have a stature with 
continuous Lagrangian models. However, not every construct that is well defined in the context of continuous models 
is also well defined in the context of discrete models. In the diagram above, at left, any point of A1 near the area 
straddling 5 (v) and d^{v) could be considered the base point of the discrete tangent vector v. The finite — as opposed 



vert;;T^ , 





to infinitesimal — nature of the discrete tangent vectors precludes an unambiguous association of configurations to 
elements of the discrete phase spaces. In the discrete context, the association of configurations to points of veloc- 
ity phase space is artificial — like the invocation of a metric or connection where none is really natural. This, of 
course, is somewhat unintuitive after such concentration on the continuous systems. The reflex to associate a unique 
configuration to a velocity has to be unlearned. 

Lemma 3. For all v e 'V, Tyd^ [resp. Tyd^ ] is a linear isomorphism from vert^'V [resp. vert^'Vy to T^-fy^M [resp. 
Ta^(v)M]. 

Proof. If 6v € vertj^'V and T^.d^iSv) = then 6v e kerT^^ n Td^, so 6v = by Definition [2] Thus T^d^ has trivial 
kernel on vert^'V and the result follows because dim vert^'V = dim A1 = dim T„,M. a 

In particular, if 6m e Ti„Ai and d^{v) - m [resp. d^(v) = m], then there is a unique 6v e vert^'V [resp. 6v e vertjI^'V] 
such that Td'*'(6v) = 6m [resp. Td^{6v) - 6m]. If v is understood, then we will write 6v = 5m^ [resp. 6v - 6m^] i.e. 
5m^ and 5m" satisfy 

Td'(6m^) = 0, Td^{6m^) = 6m, Td'(6m-) = 6m, Td^(6m-) = 0. 

Combining, if v, v e 'V are such that d^{v) - m - d^iy), then there are unique vectors 6v 6 vert^'V and 6v e vertt 'V 
such that 

Td^(6v) = 6m = Td-(6v). 

This provides a linear isomorphism Tf,,: vert^'V -^ vertt'V as follows: if 6v € vert^'V, then Tx.y{6v) = 6v is the 
unique vector in vertt 'V such that Tx,d^(6v) = T^,d^(6v). 

Suppose that CV, d^ , d^) is a discrete tangent bundle of M and is a smooth one form on 'V. Let v e 'V, and set 
m = d^{v) and m = d^(v). We will have use of two bilinear forms on T^Ai x Tff,Ai, denoted d^d(v) and d'^Oiy), and 
defined as follows. Given 6m e T„,M and 6m e Tfj,M, choose (local) vector fields X and X with values in vert^'V and 
vert^'V respectively, such that 

Td-{X{m)) = 6m, Td' X{x) = Td' X{y) if 5"(x) = d'iy), 

and also this with 6m, X, and '+' instead of 5m, X, and '-', respectively. Such vector fields X and X commute, because 

d 



Td-{X,X]{m) 



dt 



Td- iF^Xx(m) = - Td- x{F^(m)) = 



Define 



cre{v)(5m, 6m) = (X(<^6»))(y), d^6(v)(5m, 6m) = {X(ix0))(v). 



(10) 



This well defines (T'Oiv) : if X' and X' are other choices of such vector fields, then, using the identity i'[v,w]<^ 

{X{ix0)){v) = Lx'ixOiv) = ixLx'9{v) = ix^Lx'Oiv) = X'fe,0)(v), 
and similarly d^6{v) is well defined. Also, note that 

^^(^(v), X(v)) = X{ixe)(v) - X{ixe){v) - e{[X, X\){v) = cre{v){6m, 6m) - d^e(v)(6m, 6m), 
so d'^O = d'^d if and only if 6 is closed. 



3 Lagrange-d'Alembert principle 

3.1 Continuous Lagrange-d'Alembert principle 

Let (3 be a manifold of system configurations, and D c TQ be a submanifold such that tq\D is a submersion. To a 
given Lagrangian L : TQ — > R, the corresponding action functional assigns to curves q{t) e Q the number 



S = f L{q'{t))dt. 

Ja 

The variational derivative of S is 

d 



(11) 



dSiq(t))6q(t) = 



de 



e=0 



5(<7.(0) = 0, 



where 6q{t) is a curve in TQ, and ^^ (/) satisfies 

e=0 0£ 



qM 



q,(t) = 5^(f). 



Let £ be a subbundle of the pull-back bundle (tq|£))*(7'(3) i.e. a smooth assignment of subspaces of T^Q to each 
Vq e D. By definition, the curve q(t) is an evoJutioji if it satisfies LdA{L,D,S), defined as: 

LdA-1 (given constraint): q'(t) e D. 

LdA-2 (criticality): dS (q(t)) 6q(t) = for all 6q which satisfy 

LdA-2a (given constraint forces): 6q{t) € Sq'(ty, 

LdA-2b (fixed boundary): 6q(a) = and 6q{b) - 0. 

This is the general version, where the annihilator of £,, is the vector space of the constraint forces at state Vg, and 
where D is a nonlinear constraint on velocities. 

Since tq is assumed to be a submersion on D, ksi(T{TQ\D)) is a subbundle of vert TD with fiber dimension 
dim D - dim Q, and 



i),^ = {wgeTQ:veTt,^WgeTD}, 



(12) 
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is a subbundle of {TQ\Dy{TQ) with the same fiber dimension. One possibiHty for S is Chetaev's rule & = i), but other 
choices may be appropriate, as discussed for example in (14). Chetaev's rule specializes to the usual case of linear 
constraints if © is a distribution on TQ, because then £),,^ - Dq. 

The Lagrange-d'Alembert principle above is written for curves q{t) e Q. We now transform it a variational 
principle for curves v{t) e TQ, by placing q{t) and v(t) in one-to-one correspondence using q(t) - tq v{t) and v{t) - 
q'(t). The transformed variational principle has the additional constraint (the first order constraint) v(t) = (tq v(t))' on 
curves v{t) e TQ. Substituting q'(t) = v(t) into Equation ( [TT] i transforms the action to 



Ja 



L{v(t)) dt, 



and we extend S to all curves v(t) by this same formula. The variational derivative of S is 
d 



dS{v(t))6v(t) = 



de 



6=0 



SivM), 



where 6v(t) is a curve in TTQ, and v^(t) satisfies 

d_ 
5^ 



vjt) 



v(t). 



£=() 



v,(t) = dv(t). 



6=0 



The variation Ve(0 implies a variation q^{t) - tq v^it); dilTerentiating this in e gives 6q(t) = Ttq 6v{t), and thus the 
constraint 6q(t) e £,.(,). The first order constraint gives Vf(f) - {tq v^(t))' and differentiating in e gives the constraint 
on 6v(t) corresponding to the first order constraint: 



6v{t) = 



e=o oe 



d d 

■^TQ v^(f) = SQ —Ttq 6v(t) = SQ Sq{t) , 

;=() Ot dt 



where sq is the canonical involution on TTQ. Thus, the Lagrange-d'Alembert principle transforms to LdA'(L, £), £): 

LdA'-l (constraints): 

LdA'-la (given constraint): v{t) e £); 

LdA'-lb (first order constraint): v(t) = {tq v(f))'. 
LdA'-2 (criticahty): dS {v{t)) 6v(t) - for all ^v which satisfy 

LdA'-2a (given constraint forces): 6q{t) G &v(t), where 6q = Ttq 6v(ty, 

LdA'-2b (fixed boundary): dq(a) = and 6q(b) = 0; 

LdA'-2c (first order constraint): Sv = sq 6q' . 

This transformed principle LdA' has some technical advantages and is better suited to construct discretizations where 
the discrete states are elements of TQ. It has been used in ( fTSl l. where the higher dimension of TQ as opposed to Q 
provides some required freedom in a desingularization of continuous Lagrangian systems at time interval zero. 

Remark 6. Implementing the first order constraint as a Lagrange multiplier gives 



4 

Ja 



L{}it)) + {p{t),q'(t)-v(t))dt 



(13) 



where naturally p{t) e T*Q. This is the Hamilton-Pontryagin principle ([26l |27] ). The HP principle variationally 
identifies the Legendre transform as the Lagrange multiplier of the first order constraint i.e. it implies the constraint 
p = FL. See Section[33]for a few more comments on discretizations of the HP principle. 
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3.2 Discrete Lagrange-d'Alembert principle 

We will develop a variational principle for sequences of points in a discrete tangent bundle 'V, analogously with the 



continuous LdA' principle of Section 3. 1 which is a variational principle for curves with values in TQ. 

Let (3 be a configuration manifold and CV, d^, 5 ) a discrete tangent bundle on Q. The underlying structure for the 
discrete variational principle, which we call the Lagrange-d'Alembert principle is as follows: 

• Given a (discrete) Lagrangian L^ : 'V — > R, the corresponding discrete action functional assigns to sequences 
Vd{k) e'V,k- 1 , . . . , A^, the number 



Sd,N(vd) = 2_] Ld{vd(k)). 



k=\ 

We have reserved the subscript d to distinguish the discrete and continuous contexts. The derivative ofSd is 
d 



dSd.N{vd) Svd = , 
de 



Sd.Nb'd,e) 
e=0 



where 6vd e T^[\,N], and Vd,e ^^[l,N] satisfies 

d 



Vd,u{k) = Vd(k), 

oe 



Vd,e{k) = 6vd(k). 

e=0 



The discrete velocity constraint is provided by a submanifold Dd of 'V, such that d^\Dd and d \Dd are submer- 
sions. 

In the continuous context, the constraint forces are determined by an association & of subspaces of TQ to 
velocities in the continuous constraint D. For the discrete context we assume a subbundle &d of the pullback 
bundle (5^|£)rf)*(T'<3) i.e. &d is a smooth assignment of subspaces of Tg+^y)Q to v e Dd- 

Higher order of accuracy of discretizations of (continuous) nonholonomically constrained Lagrangian systems 
require discrete analogues that do not fit the pattern just described. To accommodate this, we generalize and 
replace dLd with a one form od on T^ and replace the derivative of the action dS^ with I,d,N where 



^d,N(^d)6vd = 2_j^divd(k))6vd(k). 



k=l 

The net eff'ect is that the discrete analogue od of the derivative of the Lagrangian is not necessarily closed, and 
it contributes to the discrete analogue of the derivative of the action, which also is not necessarily closed. 

A discrete constrained Lagrangian system (DCLS) is a tuple (^,crd,Dd,Sd) as above, where 'V = CV, 5^,5 ) is a 
discrete tangent bundle. If the constraint is absent, then the tuple CV, (Td) is simply a discrete Lagrangian system 
(DLS). By definition, a sequence v(k) is an evolution if it satisfies 

LdAj-1 (constraints): 

LdA^-la (given constraint): v(k) e Dd', 

LdAj-lb (first order constraint): d^(vd(k)) - d'{vd(k + 1)). 
LdA^-2 (criticality): '^dM^dik)) Sv{k) = for all Sv(k) which satisfy 

LdAj-2a (given constraint forces): Td^{6vd(k)) e (Sd)vj{kh 

LdA;, -2b (fixed boundary): Td-{6v(l)) = and Td^{6v(N)) = 0; 

LdA;, -2c (first order constraint): Td^(6vd{k)) = Td-(6vd(k + 1)). 
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This is the discrete Lagrange-d'Alembert principle LdA^^V, Od, Dd,Sd)- 

Definition 4. 

Cd,N = [we'V[l,N]: d'-iwik)) = d-(w(k + 1))}; 
Nd.N = {we Cd,N ■ w(k) e Dd]; 

^Vd,N = {6we T,,Cd,N : rr((5w(l)) = 0, Td^i6w{N)) = 0, Td^i6w{k)) e (6^),,}. 
Altogether, a sequence v^ satisfies LdA^ if and only if 

1. v^ lies in the manifold Crf A?; and 

2. Vd satisfies the constraint v^ e Nd^N', and 

3. v^ is critical, meaning ^d,Nivd) 6vd - for all 6vd e 'Wd,N- 

The restriction to the first order submanifold Cd,N is implemented first. In part this is because the distribution ^d,N has 
no natural extension away from Cd.N^ since it is the common value of the backward and forward projected variations 
that have to be in £. 

Remark 7. LdA^ is potentially a skew critical problem, in that one seeks points in a constraint — Nd,N — where a one 
form — Erf AT — annihilates a distribution — ^d,N — but that distribution is not necessarily the tangent bundle of the 
constraint. See (8) and Section]?] 

As shown in Theorem HI below, the discrete Lagrange-d'Alembert principle for sequences of arbitrary length is 
equivalent to the same principle for consecutive pairs of the sequence. This is critical to the construction of integrators, 
because it reduces an optimization on length A^ sequences to an iteration on length N - 2 sequences. Thus, the N - 2 
case occurs often, and it is helpful to abbreviate its notations. 

Definition 5. 

I.d{v, V) = Odiv) + (Td(v), 

Cd = {(v,v)e^x^: d^{v) = r (v)), 

Nd = {{v,v)(^Cd:v,veDd], 

"Wd = {(6v,6v) e TCd : Td-(6v) = 0, Td^(6v) = 0, Td^(6v) e (6^),}. 

i.e. the atomic N - 2 case is abbreviated by by dropping the N subscript, (v, v) is a solution pair if it satisfies LdA'for 

A? = 2. 

Tfieorem 4. Let {'V,(rd,Dd,£>d) be a DCLS. Then a sequence Vd(k) is a discrete evolution if and only if each pair 
(v, v) = {vd(k), Vd(k + 1)) is a discrete evolution, 1 < k < N — I. 

Proof. Obviously, vj e Nd,N if and only if every pair (vdik), Vd{k + l))e Nd- So it is only necessary to show that, 
for Vd e Nd,N, '^d,Nivd) annihilates every 6vd e (y^d,N)vj if and only if Erf(v(fe), v(k + 1)) annihilates every {5v, 6v) e 
r^d\v{k)Ak+\))A<k<N-l. 

A sequence 6vd e ^d.N has 5vd(lY - and 6vd(Ny = by definition, and so is the sum of the sequences which 
are the rows of the following array: 

Svdiiy 5vd{2)- 

6vd{2r svdor 

SvdOY 6vd{A)- ■■■ 

■ ■ ■ 5vd{N - ly SvdiNY 
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As is easily verified, this corresponds to the direct sum decomposition 

A'-l 

'Wa,N = 0(nVd)(Ki).K-t+i)) (14) 

k=\ 

where elements of the subspaces iy^ d){v(k),\ik+\)) are understood to be, after appropriate padding with zeros, sequences 
of length A^. Thus E^at vanishes on ^d,N if and only if E,/jv vanishes on each factor {^ d)(\ik),v{k+i))- But, for 5vd e 

'Wd,N, 

N 

^d,N(vd) 6vd = 2_i '^divd(k)) Svd{k) 

k=\ 

N 

= 2 (rd{vd{k)){6vd{k)- + dvdikf) 

N-\ 

= (Td{vd(l))5vd(l)- + cTd{vd{N))5vd{NT + Y^Tdivdik + l))6vd{k + ly + (Td{}'d(k))Svd{kT) 

k=\ 
N 

= Y^ -Ldivdik), Vd(k + mSvdikr, 6vd(k + 1)-), 



i=l 



i.e 



with respect to the decomposition ([14]), Sjat - ®^^i ^d- Q 

Remark 8. Many fundamental physical systems have continuous variational formulations with a fixed boundary con- 
straint, and with action defined as an integral of a local Lagrangian. The solutions of such variational formulations have 
the essential property of localization: restrictions of solutions are solutions. This follows directly from the variational 
principle. Indeed, the action of a solution is a sum of the action over the restriction of a solution and the complement 
of that, and a fixed boundary variation of such a restriction is a variation of the whole. So the restriction is critical 
under such variations, because under them the whole is critical and the action is constant on the complement of the 
restriction. The proof of Theorem]?] which is also purely variational, shows that the discrete skew critical problem, 
for arbitrarily long sequences, is equivalent to successive skew critical problems, for sequences of length 2. This is 
because the discrete action is a sum over oj, and because of the fixed boundary constraint. Thus the discrete systems 
have localization to the discretization scales for the same reasons that the continuous systems have localization to 
arbitrary scales. 

3.3 The discrete Hamilton-Pontryagin principle 

One approach to discretizations of Lagrangian systems is through discretizations of the Hamilton-Pontryagin princi- 
ple ( [T3] l as in (]6l [T3] ). The Hamilton-Pontryagin principle does not immediately discretize in the formalism of this 
article, because 

1. it requires the difference q'(t) - v(t), but discrete tangent bundles do not support linear operations; and 

2. it requires q{t) from v(f), whereas there is no unique projection to configurations from discrete tangent bundles. 

To recover the HP principle in our context, one might posit additional constructs sufficient to intrinsically write the 
principle itself. For example, a discrete analogue of the difference q'{t) - v(t) could be constructed using an appropriate 
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submersion A: 'V X'V ^ TQ. We choose not to pursue this here, but rather note that one can an apply Lagrange 
multipHers to the variations, after differentiating the action and after imposing the second order constraint in phase 
space. That is, the discrete HP principle is obtained by removing LdA^-2c and replacing LdAJj-2 with 

'^dA^dik)) 6v(k) + {pk, Td-{5vd{k + 1)) - Td\6vd(k))} = 0. 

The difference is valid because it occurs in the single tangent fiber of TQ at (9^(v^(fc)) - d^{yd{k + 1)). Reverting to 

N -2 gives 

o-j(v) 5v + crd(v) 6v + {p, Td-{5v)) - {p, Td^{5v)) = 0, 
or, putting separately 6v - 6q^, 5v = and then 6v - 0, 6v = Sq^, 

{p,5q) = {crd{v),5q^), {p,6q) = -{crd{\^,5q'). 

This identifies the discrete Legendre transforms 

<F+L(v), 5q) = {(tAv), 5q*), {F-L{v\ 6q) = -{ctAv), Sq'), 

and we have the commutative diagrams 






Q 




<V -^^ T'Q 



Q 




4 Equations of motion 

4.1 Continuous equations of motion 

In the continuous context, localization as explained in Remark [8] lends to the expectation of differential equations of 
motion (fTSt . Defining the second order submanifold 

Q = {q"(0) : q(t) a C^ curve in Q], 

there is a unique section dL of hom(7'(3, T*Q) and a unique section 6L: Q ^> T*Q of the bundle {ttq\Q)*{T*Q), such 
that 



dS (q(t)) 6q(t) = f 6L{q"{t)) 6q(t) dt + dL(q'(t)) 6q 

J a 

Defining the one form Oi{Vq)w^.^ = dL(Vg) Ttq(w^.^), 

dS (q(t)) 6q(t) = f 6L{q"{t)) 6q(t) dt + Oiiqit)) 6v 

Ja 



(15) 



where 



6q(t) = 



e=0 



^.(0, 



6v(t) = 



de 



d_ 

e=odt 



r=0 



qM- 
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So the variational principle identifies 6L and 6i directly, and a curve q'{i) is an evolution if and only it satisfies 

q'it) e D, 6L{q'{t)) = -A{t) e ann£. (16) 

These are the (continuous) Lagrange-d'Alembert equations for curves in Q. In coordinates, Q = {{q', q', q',q')}, and 
_/dL _ d_dL\ i _(dL _ d^L . ,. d^L ..A ,. 

so that a curve q'(t) is an evolution if and only it satisfies the familiar 

dq^ ^ dL d dL 

-j- eD, —--——-= -Ai(t) e ann£. 

dt oq' dt oq' 

All this corresponds to the principle LdA i.e. for curves on Q. To cast it to the form of LdA' i.e. for curves on TQ, 
one must assume that v(f) is first order, or else the integration-by-parts inherent in Equation ([TSj will fail. Under that 
restriction the various formulae transform easily, and Equations ( fTSj l become 

v(f) = (tq y(f))', v(t) e £), 6L{v{t)) = -A{t) e annfi. 

These are the (continuous) Lagrange-d'Alembert equations for curves in TQ. 

The first and second fiber derivatives ([U of L are the maps FL : TQ — > T*Q and F^L : TQ — > T^Q defined by 

FL(Vg)Wg = D(L\TgQ)(Vg)Wg, F^L(V,)(Wg, Wg) = D^ {L\T qQ){V q){W q . Wq). 

A Lagrangian L is called {D,S)-regular if, for all Vg e D, the condition: Wg e i),,^^ (recall Equation ( [T2] l) and 
F^L(v^)(w^, vv^) = for all vv^y e fi,,^ implies w^ = 0. If L is (£), fi)-regular, then the fiber dimension of £) and the fiber 
dimension of fi are necessarily equal i.e. regularity implies 

dim D - dim Q - fdim fi 

or equivalently 

2 dim Q - dim £) = dim Q - fdim fi. 

The number of constraints in LdA (or its equivalent LdA') is the codimension of D in TQ, while the dimension of the 
space of constraint forces is the fiber dimension of the annihilator subbundle of fi in (TQYiTQ) i.e. there are 

2 dim Q - dim £), dim Q - fdim fi 

independent constraints and constraint forces, respectively. At the outset of LdA, D and fi are hypothesized and 
independent, and the number of independent constraints is unrelated to the number of independent constraint forces. 
Given regularity, equality of these is assured, and we set 

r = dim D - dim Q = fdim fi, 

and also there is (TO') a unique second order vector field Yg^ on D such that SL o Ygi annihilates fi. Existence and 
uniqueness for LdA on the phase space D follows because its evolutions are the integral curves of Y^l- 
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4.2 Discrete equations of motion 

To develop discrete equations of motion, we make the following definitions: 

1. Qd = {(v, v) : d^(v) = 6^(1')}, which is a submanifold ofVx^. This is set theoretically the same as Cd, however 
for Qd we consider 'V x 'V to be a discrete tangent bundle of 'V, whereas for Cd we consider "V x "V to be the 
atomic two-point evolutions in 'V. 

2. dodiv, v) Sq = Od{v) 6q^ + Od{v) 6q^, which is a section of the pullback bundle ttJ TQ, where Ud'. Q ^ 2 by 
7rrf(v,v) = 5» = 5-(y). 

3. 0Orf(^) ^^ = -^d(v) 6v^, ^i(^) <5v = (Tdiy) 6v^, which are both one forms on 'V. 

4. w^_, = -rf0^, and<=rf0j_^. 

Remark 9. 0t - 0Z. - cTd so a>t = «,:_ if od is closed, and in this case we write Wq-, for either 

'-'ill "-'(f ^d ^d " 

From the proof of Theorem HI the discrete analogue of Equation ([TSj is Equation ^VT) below. This is a critical 
equation for our development and so it is separated here as a theorem. 

Theorem 5. If{'V, a-d, Dd, &d) is a DCLS then 

N-\ 

L*c^^YdAvd) 5vd = Yj ^"^din, Vk+i)6q{k) + 0+ (v^) 6v{N) - 0^/vi) 5v(l) (17) 

where icjn'- Cd,N — > 'T^[1,A^] is the inclusion and Sq = Td^{6vd(k)) — Td^{6vd(k + 1)). 
Thus, a sequence v(k) is an evolution if and only if it consists of pairs (v, v) which satisfy 

5+(v) = r(v), V, V e Dd, ScTdiv, v) e ann&d- (18) 

These are the discrete Lagrange-d'Alembert equations. 

There is no general existence and uniqueness result for the nonlinear algebraic Equations ( fTS] !. However, local 
existence and uniqueness can be analyzed at the level of linearizations, using the inverse function theorem: View ( fTSj ) 
as equations for v e Dd given fixed v e Dd, and denote q = d^(v). Smoothly choose (local) vector fields Xgg on 
Q, linearly parametrized by elements of 6q e {Sd)v, such that Xgg{q) = 6q i.e. the vector fields Xsg are extensions of 
6q e iSd)v Each Xgg lifts to a vector field XZ taking values in vert^'V and such that TcJ" X. = X. d^ . Equations ( [TSl ) 
may be written 

V e {d-\Dd)-'{q), (Td(v)X-^{v) - -(Td{v)6q, 5q € {&d)v 

The Unearization of these in v is the derivative with respect to v, in direction dv € Tf{{d^ \Ddy^ (q)), of the left side of 
the second equation. Such <5v are obtained one-to-one as Sq^ from 6q € Td"^ (weitz 'V nT^Dd). So, from Equations ( [TO] l, 
the required condition is that the bilinear form 

(6q, 6q) ^ d^Od(v)(6q, 6q), Sq e (fi^),,, Sq e r5+(vertr'y n Tj.Dd) (19) 

is nonsingular 

In continuous Lagrangian mechanics, the term 'regular' refers to linear conditions that provide proper equations 
of motion (hyperregular is the global condition that the Legendre transform is a diffeomorphism), and there is a single 
notion of regular, which is equivalent to nondegeneracy of the Lagrange two-form. But linear conditions are of the 
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infinitesimal, and they do not migrate well to the discrete context, which is finite. The discrete tangent bundle does 
not intrinsically support linear operations. So it is not that surprising to find a variety of notions of regularity in the 
discrete context, and we collect some of these here. There are more possibilities than the below: for example, more 
can be generated by replacing d^ with d^ . 

Definition 6. Let (y, era, Dj, &d) be a DCLS. 

1. (Jd is regular" if, for all (v, v) e Q such that v,v e Dd, the conditions (1) d'^OdiyYdq, 6q) - for all 6q e {&d)v, 
and (2) 6q € r5"(vert+'y n T^.D^,), imply 5q = 0; 

2. (Jd is regular^ if, for all (v, v) e (3 such that v, v e Dd, Ihe conditions (1) d'^<Jd{v){6q, 6q) = for all 6q e {&d)v, 
and (2) 5q 6 r5+(vertp 'V n T^Dd), imply 6q = 0; 

3. (Td is regular if it is regular^ and regular^ ; 

4. (Td is {V, y)-regular, V,V Q 2)d open, if it is regular, and, for all v e V there is a unique v e V such that (v, v) 
satisfies the discrete Lagrange-d'Alembert Equations ([T8|. 

The fiber dimension of ^^^(vert 'V n TD^) is dimDd - dim (3 and so, if cTd is regular^, then necessarily 

dim Dd - dim <3 = fdim &d, (20) 

and, as in the continuous context, we denote the common value by r. Thus regularity implies dimensional equality of 
the constraints and constraint forces, just as in the continuous context. 

A discrete Lagrangian vector field is a map Ya-/. t/ — > (2, where U Q'V is open, such that d^^^ ^os(^) - ^ ^^d 

<^o"rf Ya-j{v) 6q - for all 6q e (£j),,. 
By TheoremH] Vd is a discrete evolution if 

{vd(k),Vd(k + D) ^ Y^^{vd(k)), 

which says that the discrete derivative of the sequence Vd at k is the discrete Lagrange vector field at Vd{k). Discrete 
evolutions can be obtained from a discrete Lagrangian vector field by iterations of maps F defined by Yaj{v) = (v, F{v)). 

5 Structures of discrete Lagrangian systems 

Beginning with (fTsT l. and continuing with ( fT9l l. there is an effective procedure for the recognition of structure for 
variational theories, specifically symplecticity, momentum preservation, and the equations of motion. In summary, 
this procedure uses the decomposition of the action into boundary and nonboundary parts, such as Equations ( [T5| ) 
and ^Vf) . This decomposition is pulled back by the inclusion i which maps solutions into the domain of the action 
functional. In the context of a Lagrangian L and action S,, the procedure consists of the following steps: 

Momentum structure: write i^(t*dSt) — 0; 

Symplectic structure: write d{L*(dS,)) — 0; 



Symplectic equations structure: note L — — 



d 

I S,, write dL - — 

1=0 dt 



L*dS, and use Lxa - dixa + ixda. 



In this section we apply this procedure to extract the discrete structure preservation properties of a DCLS. From 
the remaining of Sectionpjthrough SectionWi let ("V, 5",5^,crj, £)rf,£j) be a given DCLS with be an evolution map 
F: Uf -^ Vf. 
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5.1 Decomposition 

The evolution map F defines an insertion lf of Uf into solutions of the DCLS by Lf(v) = (v, F(v)). Pulling back 2^ by 
if gives, from Equation ^TT) , 

42/ v) 6v = 6a;{v, F(v)) Td^dv) + et,iF(v)) TF(6v) - Q-Jv) 5v 
i.e. 

i*pl.d^F*0l^-e-,^+aF (21) 

where Of is the one form defined by 

apiv) 6v = dodiv, F{v)) Td^{5v). (22) 

An important fact is that 

apiv) 5v = for all 5v e 7,2),, such that Td^(5v) e i&dX, (23) 

because (v, F{v)) is a solution pair, and because of Equations ( [T8] l and ( |22] l. 

5.2 Momentum 

We begin with an definition of a symmetric DCLS. Let ^ be a Lie group with Lie algebra g. 

Definition 7. An action of a group ^ on the DCLS CV, 5,5^, o;;, £)^, £d), where d:'V-^Q and d^.'V^Q, means 
actions ofQ on Q and 'V such that 

1 . d^ and 5^ are intertwining; and 

2. (Td, Dd, &d are invariant; and 

3. crd(v)(^v) = for aJJ ^ e g and all v e Dd- 

In a symmetric DCLS, the derived constructs Cd,N, ^d.N, ^d,N, ^d are all invariant under diagonal actions of 0, 
and the symmetry group preserves the solutions. For the remainder Section [372] we posit the action of a group as in 
Definition^ 

Equivariance of evolutions requires uniqueness and hence has to rely on regularity. However, infinitesimal equiv- 
ariance can be recovered with only an infinitesimal flavor of regularity. 

Tlieorem 6. 

1. IfcTd is regular'^ then TF(^v) — F{^v) for all v in the domain of F and all ^ e g. 

2. Suppose cTd is (Uf, Vf)-regular^ . Then TF(gv) — gF(v) for all v e U f and g & Q such that gv e Vf. 



Proof. From discussion following Equation ( [181 ), CTd regular^ implies local existence and uniqueness of the evolutions 
via the inverse function theorem. Thus, for small f, Q invariance of the evolutions implies F{e.xp{^t)v) - exp(^f)^ and 
the first statement is obtained by differentiation at f = 0. The second statement follows from the global existence and 
uniqueness of (Uf, yF)-regularity. n 
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Generally, momentum is defined by insertion of the infinitesimal generator into the analogue of the Lagrange one 
form. There are two momenta since the DCLS context includes two Lagrange one forms 0J^ and 0^^. 

Definition 8. The momentum maps are the two functions 7^ : 'V — > g* and 7 : 'V — > g* defined by 

j^(v) = {j-(v), o = e-^p){^v) = -o-d(v)(^v)-, r^iv) = <y+(v), = <(v)(^v). 

Group invariance provides that both the momentum maps intertwine the action on 'V and the coadjoint action of g* 
i.e. the momenta are CoAd-equivariant. Furthermore, if v e Dd then J^iy) - J^iy) and the superscript on J may be 
dropped, because 

= (a-rf(v),^y) = <o-rf(v), {^vT) + {(Ta(v\ (^v)+> = -7^ (v) + J^iv). 

The momentum conservation structure for a DCLS is as follows: 

Tlieorem 7. If CTd is regular"^ , v € T)d, and F is an evolution map, then J^{F(v)) — J^(v) for any ^ € g such that 
^5+(v) €(£),.. 

Remark 10. The momenta J^ are of course not generally conserved in continuous nonholonomic mechanics. Theo- 
remlTldoes not imply conservation of arbitrary momenta for a DCLS, because if v,/ e 'y[l, A^] is an evolution then it 
is not usually possible to arrange the condition ^d'^{vd(k)) e (£>)vj(k) for constant ^ independent of k. Rather, one will 
have a map 'V 3 v — > ^^{v) which satisfies ^^{v) d^(v) e (£),,, and then 

y^(f(,„))(F(v)) - 7f.(„)(v) = %K(f(.))(v) - 7fV(,)(v) = <y(v),^'^(F(y)) - ^'^(v)), 

which is called the discrete nonholonomic momentum equation (|7]l. 

P roof of Theorem^ Insertion of the infinitesimal generator ^©^ into Equation pT| ) gives, because od annihilates in- 
finitesimal generators, 

= {et,iF(v)),^Fiv)) - <0-/v),^v> = J^iFiv)) - J^(v). 

There is no contribution from Of because of the Equation (|23]|. n 



5.3 Symplectic 

In the continuous nonholonomic systems LdA with constraint D and variations £, the distribution 

"7Co,£ = {6ve T,D : TraiSv) e £,} 

is an important object because it supports the associated nonholonomic semi-symplectic structure (l3ll9l [T9ll22l l. For a 
DCLS we explore three possible discrete analogues ofKos: 

'^D,£„ = {^'' 6 ^-^^ ■ Td-{6v) e (&d)3-(..)], 

K,,a„ = l'5v' 6 r,,^ : Td-{6v) e (8d)a-(vh Td^iM e (&d)dHv)}, (24) 



Under a the dimension condition ( pO| i, which is implied by regularity, all three of these discrete analogues have the 
same fiber dimension as the continuous "TCog: 
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Lemma 8. Ifr = dim£)j - dim (3 = fdimS^ then 

Proof. By definition, 5^|£)rf is a submersion, so 

dim Q + r — fdim "TC^ g = dim Q - fdim £, 

so fdimTCi c- - 2r follows and fdimTC^ o = 2r is similar. The fibers of (Td^Y^&d and (Td^y^Sd are transversal 
subspaces of the fibers of T'V, since 

vert+'V = ker Td^ c (Td^y^Sd, vert^'V = ker Td' c (Td-y^6d, 

and vert^ V © vert 'V = r'V. The codimension of the intersection of transversal subspaces is the sum of the codimen- 
sions, so 

cofdimTC^^g^ = cofdim(r5+)"'firf + cofdim(r5")"'firf = 2(dim<3- r) 

and the result follows because 

cofdimTC^^g^ = dim'V - fdimTC^^^^ = 2 dim (3 - fdimTC^^g^. 

D 

The distribution TC^ g is the kernel of the canonical vector bundle mapping v: T'V -^ T'V /'K^ ^ and admits ([T9l 
a curvature two form Av* on T'V such that, for all vector fields X,Y e TCi o , 

"^Oj.Sj ' Vd,6d' 

A<K* (X,Y) = -v[X,Y]. 
Clearly "7CJ ^ is involutive if and only if A-x* - 0. 
Theorem 9. Lef TC" be a subbundle ofK^ g over £)" c £)^. Suppose that 
1. 'K^ is TF invariant; 

3. d(Td = 0on 'K^J. 
Then F preserves Wo;, = wj^ — oj^^ on "K^. 
Proof. If Sv, Sw e ("TCj),, then, remembering the inclusion ^^(v) = (v, F(v)), 

d{i'pl.d)(5v,5w) = L'pdl.d{6v,5w) = rforf(v)(5v, 5w) + dcrd{F(v)){TF{5v),TF{6w)) = 0, 

because of Items fll and pi In the same way, o^/ = 0^^ - Sq.^ and dcTd = on TC^' imply wj^ — -dQ%^^ - d0^^ - w^^ on TCj. 
Thus, from the exterior aerivative of Equation ( |2T| |, 

on 7C^, so it is sufficient to show daf = on TC^. Extending 6v and (5w to vector fields V G TC^ and W G "TC^*, 

rfa/r(5v,(Jw) = y(af(W))(v) - WCa^CV^Cv) - apiW, W])(v), 
and the result follows from Equation ( |23] l because Iteml2|implies [V, W](v) G TC^ g . n 
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6 Discrete linear and holonomic constraints 

The continuous constrained Lagrangian systems LdA commonly have linear constraints, where the constraint is a 
distribution on Q, and the same distribution also provides the variations i.e. the special case where £) is a distribution 
and S = (tq\D)*D is the usual one. In this section we construct discrete analogues of this special case. 

Recall (l23l l that if m, m e A1, and 'F is a distribution on M, then m is 9^ -reachable from m if there is a piecewise 
smooth curve c: [a, fe] — > (3 such that c'(t) e ;F and c(a) - m, c{b) - m. ;F-reachability is an equivalence relation on 
M and the equivalence classes are called the orbits ofT. 

Definition 9. The DCLS ("V, d^,d^, Od, Dd, £>d) has linear constraints if there is a distribution D^ on Q such that 

1. (Sd)v = (Df)imv) for all v € Dd; and 

2. d^(v) and i9^(v) are Df -reachable for all v e Dd', and 

3. dim Dd = dimD^. 

CV, d^,d^, Od, Dd, &d) is holonomic ifDj is involutive and d<Td - 0. 

To compare this definition with the continuous context, the first condition corresponds to using the distribution D'^ 
of Q for the variations &. The second condition is fulfilled, for example, in the case where the tangent bundle 'V 
arises from a discretization which curve segments are integral curves of Df. Such curves can be regarded as discrete 
analogues of the elements of D'j, so the second condition corresponds to equating D'j with the velocity constraint. 
Thus a DCLS with linear constraints is a discrete analogue of a (continuous) LdA with linear constraints, in that a 
single distribution on configuration space generates both the velocity constraint and the variations. 

Remark IL In the context of Definition 9 we will set r = fdim£j = fdim£)^. In particular, dim£)j = dim (3 + r. 

For the remainder of Section\6\ let the DCLS CV, d^ , d^ , Ld, oit, &d) have linear constraint distribution D^. If D^ 
is integrable, then the condition that d^(v) and d^{v) are ©^-reachable is strong, because it confines these to be in the 

is a first step in this line of reasoning. 
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same r-dimensional leaf of £)f . Lemma 

Lemma 10. Suppose DR is involutive and let q e Q. Then d^ and d^ are local diffeomorphisms from (respectively) 
% n Dd and 'V^ n Dd to the leaf of D^ through q. 

Proof Let £.q be the leaf of D'j containing q e Q. By Item 2 of Definition 9 d^ immerses the r-dimensional 
submanifold ^^ n Dd into X.q, which also has dimension r. Thus d^ is a local diffeomorphism from ^^ n Dd into 
X.q- Similarly, d^ a local diffeomorphism from ^^ n Dd into X.q- □ 

Lemma 11. IfDj is involutive then 

1. (Td is regular'^ if and only if for all v e Dd, the conditions (1) d^crd{v){Sq, 6q) = for all 6q € {D^)g+{y), 
and (2) Sq € {Dj)d-(y), imply 6q - 0. 

2. (Td is regular^ if and only if for all v e Dd, the conditions (1) d^crd{v){Sq,Sq) — Q for all 6q e (D^)g-(v), 
and (2) Sq e iDf)g*(v), imply 6q - 0. 

Proof If V, V e Dd and 5+(v) = Sr(v), then (6d\ = (Df)a-(y)- LemmajTTJgives r<9+(vertf:'y n TfDd) = (D^)a*(f), and 
the result for regular^ is obtained by transcribing Definition |6J regular" is similar n 
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A skew critical problem in the meaning of Remark IT] is ordinary [resp. variational] if the tangent bundle of the 
constraint is the equal to [resp. contains] the distribution used to differentiate the objective. Ordinary critical problems 
correspond to the standard constrained optimization problem that seeks critical points of an objective subject to a 
constraint. Continuous systems with linear constraints are are variational exactly if the constraint distribution D is 
integrable i.e. exactly if the system is holonomic in the usual meaning of the term ([19). In the discrete context there is 
Theorem[T2]below. 

Theorem 12. The following are equivalent: 

1. Dy is involutive; 

2. "Wd.N is involutive; 

3. 'Wd,N = [6vd € TNd,N ■■ 5q-{l) = 0, 5q^(N) = 0}. 
Proof. Assume N -2; the proof for arbitrary A^ is similar. 
(1)=>(3). Let (v, v) e Nd, and define 

q- = d-iy), q = d^iy) = ^ (v), q^ = d^iy), 

which are all in the same leaf of D^. Temporarily define 

{Nd){v,v) = {(6v, 6v) e I^.-roCV x 'V) : Td-(6v) = 0, Td^{6v) = 0, Td^{6v) = Td-{6v), 6v e TDd, 6v e TDd). 
corresponding to the set on the right side of the equality in Item[3] Recall that 

CWd)(v,v) = {(6v,6v) e T^„,y)Cd ■ Td-(6v) = 0, Td^idv) = 0, Td^idv) = Td-i6v) e Df]. 

It is required to show that 'Vl^(i.,r.) = ^(v,v) i-e- the condition 6v, Sv e TDd in the definition of (y^d)(v,v) amounts to the 
same thing as the condition Td*{6v) - Td^iSv) e T)^ in the definition of N(v,v)- ^^ (^'^'' ^^) ^ M'-r) '■hsn Sv is tangent 
to "V^ n Dd and T^d^ maps the tangent space of this at v (isomorphically) to {T>f)q- Thus Td^{6v) e D^. On the other 
hand, if (6v,6v) e CWdXv.v), find the unique 6v' e vert^'VnDd and 6v' e vert+'VnDrf such that Td^{5v') = Td~{6v'). 
This implies Sv - Sv' since T^d"^ is a local diffeomorphism from vert^'V to TqQ. Thus 6v € TyDd since Sv' is. 
Similarly, Sv e TyDd, so {Sv,Sv) e N(v,v)- 

(2)^(1). Let X and Y be vector fields with values in D^ and let q e Q. Arrange v, v e Nd so that cJ^(v) - d^(v) - q. 
Define nd : Nd — > Q by 7Td{v, v) = d^(v) - d^(v). X and Y have have unique lifts X and Y which are vector fields on 
Nd with values in "Wd such that Tnd X -X ojid and Tud Y - Y oud- Then [X, F] o ttj = Tud {X, ?] and by assumption 
[X, T\ has values in W^, so 

{X,Y](q)^Tnd[X,Y]{v,v)&^, 

as required. 

(3)=>(2). By assumption, 'Wrf is integrable because it is equal to the kernel of the derivative of the map from Nd to 
QxQ defined by Vd i-> (d^iv), d^{v)), and any such kernel is involutive. n 

There are the following speciaUzations of the distributions ( |24] i to the context with linear constraints: 

'K^^ = TDdn(Td-r'l^, 

TC",^ = {(Td-r'nf n {TdT's)f)\j,^, 

'K^^ sTDdn(Td^r^D^. 
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The fiber dimensions of these are all 2r because the dimension condition of Lemma [8] is satisfied, as explained in 
Remark 11 Lemma 13 below shows simplifications if £)^ is involutive, and it provides a discrete analogue of the 
continuous fact that 'Kd'xs integrable if and only if D is integrable. 

Lemma 13. 

1. The following are equivalent: (lA) G^ is involutive, (IB) 'K^ is involutive, (IC) "JCL is involutive, (ID) TC^ is 
involutive. 

2. The following are equivalent: (2A) 'K^^ c TD^, (2B) 'K^^ = 'K^^^, (2C) TC^^ = 7C+_^. 

Moreover, the statements in (1) imply the statements in (2). 

Proof. 

(1A)=>(1B), (1A)=>(1C), (1A)=>(1D): pull backs and intersections of involutive distributions are involutive. 
(1A)<^(1B), (1A)<^(1C), (1A)<^(1D): IfK^^ is involutive and X,Y are vector fields on Q with X,Y e TJ^, then 
Z+,F+ e "TC^^, hence [X^,Y^] o 5+ = Td^[X^,Y^] e Df. Thus E^ is involutive. (1A)<^(1B) and (1A)<^(1D) are 



similar after using Lemma 10 to lift X and Y. 

(2A)o(2B) and (2A)o(2C): If TC^^ c TDd then 7C^^ c TDj n {Td-y^D^ = 7C^_^ and hence -TC^^ = 'K^^ by 

Lemmajs] If TC^^ = 7C^^ then 7C^^ c TC"^ c TDd- Similarly (2A)o(2C). 

(1A)^(2A): Suppose D^ is involutive and Sv e TC^^. Then Td^{6v^) = Td^{6v) € Of and 5v+ e vert^^V. By 



Lemma 
TDd. 
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there is a 6v' e TCV^ n Dd) such that Td^(6v') = Td^(Sv). Also Td-(6v) = Td-(6v') = 0, so 6v = 6v' e 



In a (continuous) nonholonomic system with linear constraints, the Lagrange two form is nonsingular on the 
distribution TCjj if and only if the Lagrangian is regular. For a DCLS, there is the following similar result in the 
holonomic case. 

Theorem 14. IfCV, d^,d^, L^, oii, £^) is holonomic then o^ is regular''' if and only if ciia^ '* nondegenerate on 'K2, . 

Proof Let v e Dd and 6v,6w e TC^^. Then Td-(6v) e U^, so there is a vector field V e U^ such that Td-{6v) = 
V{d^{v)). Similarly choose vector fields V, W, and W such that 

Td^(5v) = y(5+(v)), Td-{5w) = Wid'iw)), Td^(6w) = W{d^(w)). 

Then (5v = y+(v) + y-(v), 5w = W^{v) + W"(v). Also, 

w,^_^(y+ + y-,W+ + H^-) 



= -(y+ + y")(6i+/w+ + w-)) + (w+ + w")(0+,(y+ + y^)) + ^^([y"' + y^, w+ + h^^d 
= -(y+ + y")(cr^(w+ + W'T) + (w+ + w")(o-^(y+ + v-y) + crd{[v^ + y^, w+ + w"]+) 

= -V-{cTd{W^)) + {W-){(Td{V^)) - d<Td(V\ W+) 

= rfVrf(w, y) - cTcTdiv, w) 



oj^,{6v, 6w) = d^crd{v){Td-{6w), Td^{5v)) - d'=o-d(v){Td-(6v), Td^{5w)). 
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Suppose cTd is regular"^ and that a)aj{6v, 5w) - for all 5w € ("TC^),.. Choosing 6w e vert 'V gives 

for all 6q e (Df)g*(,,). Thus Td^(6v) = as cr^ is regular". Similarly Td'^(Sv) - since Od is regular^. Thus 6v - 0, 
proving that Wo-, is nondegenerate. The converse — that oj is regular'^ if ajj is nondegenerate on TC^ , follows by 
reversing this augment. n 

In the holonomic case, symplecticity and preservation of momentum are expected and they are recovered in the 
following two corollaries. 

Corollary 15. If F: Uf Q 'V ^ Vf Q ^ is an evolution of the holonomic DCLS {'V,d^ ,d^ ,(Td,T>ci,&d), then F 
is symplectic on "TCB, i.e. iCL is TF invariant and u)d(y){TF(6v),TF{6w)) — (jL>d(v){6v,6w) for all 5v,6w € (TCll )v,. 

Proof If suffices to verify the hypotheses of Theorem [9] with TC" = TC^^. If v € Dd and 6v e ('K^)v then 6v e TDd 
and Td^iSv) € {&d)v = {Df)a*(y). Since Dd is F invaiiant, TF{5v) e TDd- Also 

Td-{TF(6v)) = T{d- o F)(^v) = Td^{5v) e £)f 

so Td^{TF{6v)) e 'K^^ - TC^ . Furthermore /^■k* - because Lemma fl3l implies TCJ is involutive, and dad = by 
hypothesis. n 

Corollary 16. Suppose F: Uf C'V ^ Vf Q'V is an evolution of the holonomic DCLS 'V = CV, d^,d~, Od, Dd, &d), 
and suppose that a Lie group Q acts on 'Y and Q such that 

L d^ and d^ are equivariant; and 

2. Ur is Q invariant; and 

3. QQQDf. 

Then 'Y is a symmetric DCLS and F is momentum conserving. 

Proof. Invariance of 6^ follows from (&d)v - (0^)d*(v) and Items 1 and 2 From Item 3 the definition of "TC^ , and 
Lemma 
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we have a,!Dd £ "T^ £ TDd, so Dd is invariant and hence 'V is symmetric. By TheoremrTJ conservation 
of momentum follows from ^ d^(v) e (£),, for all v e Dd and all ^ G g, but this is implied by Item|3] n 

7 Lagrange-d'Alembert as a skew critical problem 

From (8) we recover some basic definitions: 

Definition 10. Let M and N be manifolds, a be a one-form on M, D be a C* distribution on M, and letg: M -^ N 
be a submersion. We call {a, D, g) a C'^ skew critical problem. A point m,. & M is a skew critical point of (a, D, g) at 

neN if 

a(mc)iv) = for all v 6 £)m, , 
g(mc) = n. 

A skew critical problem is called variational ifD is involutive. 
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Definition 11. Let nic be a skew critical point of (a, D, g). Define the bilinear form doaimc) : T„^,M x £),„ — > R by 

d£ia{mc){u, v) = {d(iva)(mc), u), 

where V is a (local) vector field with values in D such that V{mc) = v. The skew Hessian of a with respect to g and D 
is the bilinear form 

dD^goinic): k&xT„,^g x £)„,^. -^ R 
obtained by restriction ofdoainic). Define dD^gdinicf : kei T^^g — » £)*, by 

dD^ga(mJ'{u) = do,ga(mc)(u, ■)• 
A skew critical point nic of {a, D, g) is called nondegenerate if dD.gainic)^ is a linear isomorphism. 

We are using the term 'skew critical problem' in two slightly ways. First, it is any constrained critical point 
problem where the derivative is taken in directions that may be other than the constraint directions. Second, it is a 



problem within the technical meaning of Definition 10 The two are not really the same. For example, in Definition 10 
the purpose of the function g is to provide a parametrization of the critical points, rather than a constraint. It is not 
necessarily a natural constraint e.g. its level sets do not necessarily correspond to the leaves of £), in the case that D is 
integrable. 

If Dd is the level set of a submersive constraint function g^: 'V ^> V i.e. £)j - g^^ipo), then LdA^ may be cast 



as a skew critical problem within Definition 10 in a variety of ways. One of the ideas, though, of such skew critical 
problems, is that their critical points should be isolated on the level sets of the constraint function, and should be 
smoothly parametrized by values of the constraint function. So, the constraint function of the skew critical problem 
representing LdA^ should provide exactly the freedom required to fit the skew critical points to the constraint levels and 
to the initial conditions of the evolutions of LdA^. Failure of this would rule out (local) existence and uniqueness for 
discrete evolutions as a consequence of the natural persistence of (nondegenerate) skew critical points. The following 
is a formulation of LdA^ as a skew critical problem in the atomic N -2 case; the case for arbitrary A' is similar 

Definition 12. The associated skew critical problem to the DCLS i'V, oj, Dd, &d) is i^d, ^d> 8d), defined on Cd, and 
where 

gd-.Cd^'VxV, gdiv, v) = (v, gd(v)). 

Clearly, f^'(po) £ A/^ for any choice ^o = (v, po) such that ^^/(v) = /?o, and the solutions to (2^,^^,^;^) as v varies 
over Dd are exactly the solutions to LdA^. The justification for the particular choice of gd comes from the following. 

Theorem 17. All solution pairs of a regular'^ DCLS i'V, cTd, Dd, &d) ore nondegenerate skew critical points of the skew 
critical problem {'I.d,'Wd,gd)- 

Proof. Let (v^, Vc) be a solution pair. It is required to prove that 7'(,,^,,i^)Cy x 'V) 3 (6v, 6v) = if 

iSv, 6v) e ker dgd, and 

(25) 
d<Wj.gj^d{{Sv, 6v), {5w, 6w)) - for all {6w, 6w) e {^d)(v„v,)- 

Assuming (25]l, choose 6q e Wa+(,,,)(3, and extend Sq to a (local) vector field Xgq on Q, as in the development just after 
TheoremB Then X = (X^^ , X'^^ ) is a vector field in "Wd, so {6w, 5w) e (^d)(v^,i\) if ^w = X^^ (v^) and 6w = X^ (vc). 
Also, 

/x£rf(v, v) = iVcrj(v) + ix-Odiy). (26) 
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{6v,6v) e ksrdgd implies, from the definition of g^, that 6v - and 6v G TfJDd where Dd = g^\po) and po = gd{v)- 
Since the skew critical problem occurs in Cd, Td^idv) - Td'^{6v) - 0, so it is enough to show Td^(6v) - 0. The first 
term of Equation (p6l) differentiates to zero in the direction {6v, dv), and hence 



= d^^I.d{{6v, dv), (6w, div)) = d'^OdiVcXSq, 6q), 
where Sq - Td^idv) e ^^^(vertp 'V n T^Od)- Then Sq - 0, since 6q is arbitrary and the DCLS is regular^. n 

8 Concluding remarks 

Discretization is, in one view, the replacement of infinitesimal objects with finite, geometric ones, depending on what 
one wants to represent. In differential geometry this might mean the attachment of geometric objects to every point of a 
manifold. The ubiquitous Unear bundles of differential geometry become bundles of geometric shapes; the Unear fibers 
become sets on which most of the usual linear operations are absent. At /? = there is degeneration. Obtaining this 
limit requires desingularization, and, for coherent results along the whole of a continuous target, semiglobal analysis. 
For each hof a. discretization, we have a discrete analogue, which is a simpler, abstract construct, because it has not 
the burden of supporting the limit /; — > 0. 

Using curve segments to discretize tangent bundles is geometrically vivid. The abstract notion of a discrete tan- 
gent bundle {'V,d^ ,d^) (Definition |2]i of a configuration space (3, is an example of the clarity afforded by invariant 
differential geometry: 



TQ <y 



contmuous: 



(3 



discrete: / ^\ 

Q Q 



While they do not support linear operations, the discrete tangent bundles have other structures. The forward and 
backward discrete canonical projections result in decompositions because they split every fiber of T'V. 

To work Hamilton's principle directly on velocity phase space is to view the equation q(ty = v as a constraint. 
The discrete analogue of this constraint for a sequence v, e 'V is that successive curve segments attached to the v, 
join to make a continuous whole i.e. 5^(v,) = 5 (v,+i). That, together with the usual fixed endpoint constraints, and 
discrete action the sum of a discrete Lagrangian over v,, altogether define the discrete variational principle that gives 
the discrete evolution. What follows that is more-or-less a straight application of the philosophy of (.15. .19j . which 
extracts structural properties directly from variational principles. 

Practically, it is easier, and more direct, to generate curve segments rather than interpolate between the configu- 
rations (q^,q^) € Q. In (21) we derive, based on the discretizations of this article, numerical methods for explicitly 
constrained Lagrangian systems. The required curve segments may be generated using virtually any one step numer- 
ical integrator, and, automatically there follows a variational integrator of the same order. The current state of the 
art in geometric integrators for nonholonomic systems uses the MV discrete phase space (iTl fTOlfTTlfTTl l. It is future 
work to address construction of and the error analysis of nonholonomic variational integration algorithms using the 
discretizations of this article. 

The curve segments naturally shrink to points as /z — > 0. This results in a well defined and precise approach to the 
limit /z — > 0, of which Proposition [T] shows typical use. It is good to be respectful of this limit. In (|20] | we show that 
the error analysis of discrete holonomic variational integrators, which is also an issue of h ^ 0, depends on a subtle 
symmetry, and has sometimes been oversimplified. 
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The discrete phase spaces and the continuous ones are conceptually separated; there is no innate association be- 
tween discrete and continuous states, nor is there any unique configuration associated to a discrete velocity. This is 
an unnatural conceptual point and it has to be forcibly remembered, especially when, as is usual, the discrete phase 
space and the continuous one are the same, set theoretically. Since there is no innate association of continuous and 
discrete states, there is neither any association of discrete states with physical states. Such associations are inherently 
ambiguous if h > 0. 




For example, when projecting to configuration space, there are not just the two possibilities d^ and d^, but also 
TQ, any other point on the curve segment, and, if 'V = TQ c R'^, any convex combination of d^ and d^. The 
continuous tangent vector associated to curve segments is similarly ambiguous. Without further motivation, any of 
these choices are as good as any others. With motivation, such choices reflect the motivation, not the presence of a 
preeminent choice. Suppose one has a variational integrator, and some association TQ — » 'V. Conjugation by any 
structure preserving morphism, which is near to the identity to sufficiently high order in h, gives another association. 
The conjugated and original variational integrators are equivalent; the implied change in association of physical state 
to discrete states is not relevant. And, in any case, even though the discrete and continuous phase spaces may be the 
same set theoretically, the structures of the discretization do not usually have the same functional form as those of the 
continuous system. So what is the justification for identifying the phase spaces? 

If a discretization of a structured model is not structure preserving, then it is subordinate and its states are slaved to 
continuous states. If such a discretization is structure preserving and has equal stature to a continuous model, then its 
states correspond to continuous ones only ambiguously. Any specific identification of continuous and discrete states, 
such as e.g. the identification of the Lagrange multipUer of the discrete first order constraint with the continuous 
momentum of the continuous Lagrangian, can only be admitted a status similar to a possibly convenient special 
coordinate system. Of course many such coordinates exist in geometry; but they are, at most, important and useful 
intermediaries which cannot properly be elevated to the stature of necessity or structural centrality. 
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